-
Notifications
You must be signed in to change notification settings - Fork 251
Update CI to Julia version to 1.12.0 #4836
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: main
Are you sure you want to change the base?
Conversation
|
I am very interested in this. Let's hope it works and we can move on from julia 1.10 |
|
I am disabling the reactant tests for the moment to check if the rest works. |
|
If docs still break on the |
|
Seems that we are hitting the same NaN issue on the internal tide example |
the ghosts of the past still haunt us.... |
|
Apparently also |
|
If I run the example locally, it works. Why would it error on CI? Do we have a way to reproduce this error locally? |
One thing to try might be to run the example locally and on CI using the exact same Manifest.toml if possible. We can commit a Manifest.toml to this branch for debugging. I can't think of which dependency would lead to such a big difference but it's one thing we can control for. |
|
From the Julia v1.11 chat I recall that the error was showing up only for unix, not for mac? |
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
|
The plot thickens: it works correctly in Julia v1.12 on Ampere eMAG (aarch64) with AlmaLinux 8.10 as operating system, which rules out an operating system difference. aarch64 is also the architecture on macOS, so I'm starting to suspect there's an architecture dependence. Can someone point me to the operation performed on the |
Nice work so far though!! The entire time-step is a complex chain of operations. I do think it is a good start to save down all fields every time-step. We may find that differences arise in one field versus another. Note that the NaNChecker checks |
|
To save every iteration chnage this line Oceananigans.jl/examples/internal_tide.jl Line 170 in ea25179
to The difference should arise in the very first time-step? We could compare those. It seems annoying laborious to do this across architectures, but maybe @giordano you have good ideas how to do this efficiently |
|
Shall we just move the tests to julia 1.11 then until Reactant and Enzyme are ready? |
|
How easy would it be to run Reactant/Enzyme tests only with 1.11 in Buildkite? It shouldn't be hard in github actions, but I'm less familiar with buildkite. |
|
I can try giving it a go. We can actually add another pipeline, and probably need a new queue. @ali-ramadhan would you know how to set this up on nautilus if I separate out the tests in a new pipeline? |
|
I think I have managed to separate them out. |
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
Co-authored-by: Mosè Giordano <[email protected]>
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
This comment was marked as resolved.
67515b7 to
e81db39
Compare
e81db39 to
94bc181
Compare
|
Alright, I believe I found the right incantation in 94bc181. The summary is
My solution was to
|
|
The docs built and the internal tide example looks great! There seem to be some errors in the GPU abstract operations CI: https://buildkite.com/clima/oceananigans/builds/26558#019a515f-fc97-4a52-96e9-a8a5894dffdd/7-53 If that smooths out, I'll work on bringing the doctests back in! |
No description provided.